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Abstract 



We suggest a new type of problem about distances in graphs and make 
several conjectures. As a first step towards proving them, we show that for 
sufficiently large values of n and k, a graph on n vertices that has no three 
vertices pairwise at distance k has at most (n — k + l) 2 /4 pairs of vertices 
at distance k. 

1 Introduction 

>: 

In Py, Bollobas and Tyomkyn determined the maximum number of paths of 
length k in a tree T on n vertices. Here we suggest an extension of this prob- 
ed \ lem to general graphs. 

The 'obvious' extension, counting paths of a given length in a graph G, has 
been studied since 1971, see, e.g., [H El lU El El El [HI CE2] and the references 
therein. On the other hand, counting paths of length k in trees can be interpreted 
as counting pairs of vertices at distance k. Therefore, a natural question to ask is 
the following. 

Question. For a graph G on n vertices, what is the maximum possible number of 
pairs of vertices at distance k ? 

To the best of our knowledge, this question has not been considered previously. 
Our aim in this paper is to formulate several conjectures and to prove one of them 
in the first non-trivial special case. 

For a graph G, define the distance-A; graph Gj. to be the graph with vertex 
set V(G) and {x, y} G E(Gk) if and only if x and y are at distance k in G, that 
is, the shortest path between x and y has length k. We call such vertices x and y 
k-neighbours and the pair {x,y} a k-distance. We call dc k {x) the k-degree of x. 
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Figure 1: A 5-broom for k = 8 and a 5-broom for k = 7. 



Observe that if H is an induced subgraph of G, then H 2 is a subgraph of G 2 - 
This need not be the case when k > 3. It is clear that Gk — Hk does not imply 
that G = H. If Gk — Hk-, then we say that G is k-isomorphic to H. 

We wish to maximise the number of edges in Gk over all graphs Gonn vertices. 
One attempt to construct a graph with many /c-neighbours would be to consider 
what we call t-brooms. For even k > 4 and for t > 2, define a t-broom to be a 
graph consisting of a central vertex v with t 'brooms' attached, each consisting of 
a path on (k — 2)/2 vertices with leaves attached to the ends opposite v. In this 
way, the leaves of different brooms will be at distance k. For odd k > 3, to define a 
t-broom, take a copy of and attach a broom to each vertex, adjusting the length 
of the path. (See Figure [TJ) As in Turan's theorem, the number of fc-distances 
in a t-broom will be maximised when the numbers of leaves in the brooms are as 
equal as possible. 

In [8J Bollobas and Tyomkyn proved that if G is a tree, then e(Gk) is maximal 
when G is a t-broom for some t. 

Theorem 1. Let n > k. If G is a tree on n vertices, then e(Gk) is maximal when 
G is a t-broom. If k is odd, then t = 2. If k is even, then t is within 1 of 

1 H n - 1 
4 + V 16 + fc-2' 

These results prompt us to make the following conjecture. 

Conjecture 2. Let k > 3. There exists h = h(k) such that if n > h(k), then 
e(Gk) is maximised over all G with \G\ = n when G is k-isomorphic to a t-broom 
for some t. 

For small values of n there exist better constructions. For example, if k = 3 
and n = 7, the 7-cycle has more 3-distances than any t-broom. 

We firmly believe Conjecture [2] to be true, but are unable to prove it. In this 
paper, we approach Conjecture |2] by placing a restriction on u(Gk), the clique 
number of Gk, which is the maximal number of vertices at pairwise distance k. 
We formulate the following natural analogue of Conjecture [2] under this condition. 

Conjecture 3. Let k > 3 and t > 2. There is a function h 2 : NxN-^N such that 
if n > h 2 (k,t), then e(Gk) is maximised over all G with \G\ = n and u(Gk) < t 
when G is k-isomorphic to a t-broom for some t. 
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In this paper, we shall discuss the case t = 2 of Conjecture El that is, the case 
when no three vertices of G are pairwise at distance k. Note that in this case 
the parity of k matters little, as the conjectured optimal example is just a path 
of length k — 2 with equally many leaves attached to each of its endvertices. The 
number of fc-distances in such a graph is simply \_(n — k + 1) 2 /4J. 

We prove Conjecture [3] for t = 2 and sufficiently large k. More precisely, we 
prove the following assertion. 

Theorem 4. There is a constant & and a function : N — > N such that for 
all k > k , all n > n (k) and all graphs G of order n with no three vertices 
pairwise at distance k, 



Moreover, if equality holds, then G is k-isomorphic to the double broom. 

We do not make any effort to determine k$ and exactly; on the contrary, 
we are rather generous about them. However, we conjecture that k = 2 and 
(n, k) = (7, 3) are the only exceptions to the optimality of the double-broom. 

Conjecture 5. In the setting of Theorem^ we can take k = 3, n (3) = 8 and 
no{k) = k + 1 otherwise. 

For k = 2, we can do a little better than the bound in Theorem HJ as the 
following construction shows. 

Example 6. Let X and Y be cliques on (n+l)/2 vertices each, with a vertex z in 
common. Take vertices x G X and y G Y . Remove the edges {x, z} and {y, z} and 
add the edge {x, y}; call the resulting graph G. Then G2, the distance-2 graph 
of G, is a complete bipartite graph with one edge subdivided, and thus has a total 
of (n - l) 2 /4 + 1 edges. 

We believe that for n > 5, a triangle- free G2 can have no more than (n— l) 2 /4+l 
edges. This is clearly true for n = 5, and a computer search verifies that it also 
holds for 6 < n < 11. However, we cannot prove that it is true in general. If this is 
indeed so, then it shows that for k = 2 the quantity (n — k + 1) 2 /4 is within 1 of the 
maximum. The same would hold for k = 3 and n = 7, when the aforementioned 
7-cycle wins by 1 over the double broom. 

In the general case, with no restriction on 07(^2), the maximum value of e(G2) 
is straightforward. 

Proposition 7. Let G be a graph on n vertices. Then 



Moreover, if equality holds, then G is a star. 

Proof. The result is clear for n = 3. We suppose that for some n > 3, the result 
holds for all graphs on at most n — 1 vertices. Let G be a graph on n vertices. We 
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may assume that G is connected. Otherwise, if G has I > 2 components of sizes 
Ci, . . . , Q, say, then, by hypothesis, 



<G 2 ) < W 
i=l ^ 



d — 1\ ( n — 1 



Observe that, setting d = diam(G), 

= e(G) + e(G 2 ) + ■■■ + e(G d ) 



Thus, 



r(G,)< [ n 2 ] -d(V). (Li) 



Since G is connected, e(G) > n—1, and hence eiGz) < ("2 )• Note that if equality 
holds, then G must be a tree. Moreover, in this case, by (11. ip . diam(G) = 2, so G 
must be a star. □ 



2 Preliminaries 

In this section, we shall prove a straightforward bound on e(Gk)- We shall then 
discuss possible ways of extending this result to the upper bound in Theorem HI 
We shall also discuss a useful property of spanning trees. 

Since Gk is triangle-free by our assumption, Mantel's theorem implies that 
e(Gk) < n 2 /A. In fact, we can do somewhat better by adapting a standard proof 
of Mantel's Theorem (see, e.g., j2]) to /c-distances. 

Lemma 8. If Gk is triangle- free, then 

e(G k ) < 

Proof. For a vertex x, let v{x) be the number of /c-neighbours of x, or equivalently, 
the degree of x in Gk- We may bound e(Gk) as follows: for each pair of vertices 
x, y at distance k, count the fc-neighbours of x and of y. Note that since x and 
y have no common /^-neighbours (otherwise there would be a triangle in Gk), we 
have v (x) + v{y) < n. In fact, we can claim that v{x) +v(y) < n — k + 1, since none 
of the h — 1 internal vertices on the shortest path between x and y are fc-neighbours 
of x or of y. Summing over all such pairs {x, y}, we obtain 

(n-k + l)e(G k )> J2 Hx) + v(v))- (2-1) 

{x,y}&E{G k ) 

Observe that for each x G V(G), the quantity v(x) appears v(x) times on the 
right-hand side of (12. ip . By the Cauchy-Schwarz inequality, we have 

*(*))= E u ^ 2 ^l( E ^)) =^( e (^)) 2 - ( 2 - 2 ) 
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It follows from (j2HJ and (E2D that 



e(G t ) < (2 .3) 

as claimed. □ 

The bound that we have just proved is about halfway between the trivial n 2 /A 
and the desired (n — k + l) 2 /4. There are two natural ways in which one could try 
to improve (12. 3ft . One is to try to find vertices that have no k- neighbours at all. We 
say that such a vertex is an interior vertex; otherwise, a vertex is called an exterior 
vertex. The inspiration for this terminology is as follows. If k < diam(G) < 2k — 1, 
then a vertex at or near the centre of the graph has no ^-neighbours, while a vertex 
that is far from the centre of the graph will have one or more fc-neighbours. If G 
has at least r interior vertices, then adapting the proof of Lemma [8] improves the 
bound in (12. 3p to 

e(G4 ) < <"- r ><;- t+1 > . (2.4) 

Thus, we are done if we can find at least k — 1 interior vertices (which holds in the 
case when G is the double-broom). So, we may assume that r < k — 1. 

The other way to improve (12.31) would be to find many pairs of vertices {u, v} 
such that many vertices z are fc-neighbours of neither u nor v. We say that a 
vertex v £ V is k -unaffiliated with a vertex u £ V if d(u, v ) ^ k. A vertex v is k- 
unaffiliated with a set U C V if it is ^-unaffiliated with each u £ U . Otherwise, we 
say that v is k- affiliated with U. Thus, an interior vertex is fc-unaffiliated with V. 
If G has r interior vertices and each {u,v} £ E(Gk) has p ^-unaffiliated vertices, 
then (12. 3p improves to 

/ r + p 

e(G t ) < <"-''f < V~7~' . (2.5) 

In particular, we are done if p > 2A; — r — 2, i.e., if every {"U,f} £ E(Gk) has at 
least — r — 1 /^-unaffiliated vertices other than those on the shortest path between 
u and v . 

In order to prove Theorem HI we shall show that for every pair of A> neighbours 
in G, p > 2k — r — 2. To do so, we shall need to study paths in G. In connection 
with this, we shall need the following easy result about the lengths of paths in a 
spanning tree. If P is a path, we write \P\ to denote the number of vertices in P. 

Lemma 9. Let r > 2 and let G be a graph on at least r + 1 vertices. If G has at 
most r interior vertices, then every spanning tree of G either contains no path of 
length at least r + 1 or contains a path of length 2k — r. 

Proof. Let T be a spanning tree of G such that T contains a path of length at 
least r + 1. Let P be a longest path in T and let u and v be its endpoints. Suppose 
that \P\ < 2k — r. Let x±, . . . , x r+ \ be the r + 1 central vertices of P, indexed 
consecutively in order of increasing distance from u. (If \P\ is even, then there are 
two choices for the r + 1 central vertices of P; choose one arbitrarily.) Suppose 
that some x,- t had a ^-neighbour in G, called y. Let Q be the path in T from X{ 
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to y. Since, for all x, y € G, d G (x,y) < d T (x,y), we know that \Q\ > k + 1. Let 
z be the furthest vertex from Xi at which P and Q coincide; by symmetry, we 
may assume that z is closer to v than Xi is. We shall construct a path in T that 
is longer than P, in contradiction to the assumption. Define P + to be the path 
formed by Q and the portion of P between u and Xj. Then 

\P+\ = \Q\ + d T (xi,Xi) + dr(u,xx) > (k + 1) + (i — 1) + d T {u,Xi). 

Because the r + 1 central vertices of P, we have 



x\) > 

from which we deduce that 



\P\ - 1 - drOl^r+l} 



> 



IPI - r - 2 



, , , \P\-r-2 \P\ + 2k-r , , 

P+ > A; + i + J—! > — > P . 

2 - 2 11 

Thus, no can have a fc-neighbour in G, which means that all of the X{ must be 
interior vertices. □ 

The following concepts will play key roles in the proof of Theorem HI Let v and 
w be two vertices at distance k. Recall that a geodesic is a shortest path between 
two vertices in a graph. Define the vw-path P to be a shortest path between v 
and w. Order the vertices of G and conduct a breadth-first search starting from 
v such that the resulting tree T v contains P. We define the v-path P v to be the 
longest path in T v . (If there is more than one longest path in T v , then we choose 
one arbitrarily.) Similarly, we define T w to be breadth- first tree with respect to w 
containing P, and the w-path P w to be the longest path in T w . 

Consider the breadth-first tree T v and the v-path P v . Let x and y be the 
endpoints of P v . Moving along P v from x to y, or in fact along any path in T v , 
the distance from v will first decrease, then increase — this is a fundamental 
property of breadth-first search trees, for the depth of a vertex w in such a tree 
equals da(v,w). Thus, P v can be divided into two geodesies. Let z be a vertex 
of P v at minimal distance from v. Let P\ denote the portion of P v between x and 
z and P2 the portion of P v between z and y; one of these may be empty. 

By our assumption that r < k — 1, if v and w are fc-neighbours, then T v 
must contain a path of length at least r + 1. Hence, by Lemma P v contains at 
least 2k — r vertices. Note also that at most two vertices on P v can be at distance k 
from v. 



3 Proof of Theorem 3 

From now on, we shall assume that G satisfies e{Gk) > (n — k + l) 2 /4. As noted 
above, we shall also assume that r < k — 1. 

Let us briefly discuss the proof of Theorem HI First, for k and n large enough, 
we shall prove a simple condition under which a pair of /c-neighbours must have at 
least 2k — r — 2 /^-unaffiliated vertices. We shall deduce from this that each pair of 
^-neighbours in G has almost enough ^-unaffiliated vertices to achieve the desired 
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bound on e(G f / c ). Second, we shall deduce our key lemma, which says that if some 
pair of ^-neighbours {v,w} does not have enough ^-unaffiliated vertices, then all 
geodesies in G must have only a few vertices apart from each of P v and P w . Third, 
we shall show that in this case, every other /c-neighbour of v is at distance o(k) from 
w, and vice- versa. Moreover, for some 8 = o(l), we shall show that all vertices that 
are at distance at least 8k from both v and w have very few ^-neighbours, which, 
by Turan's theorem, will contradict our assumption that e(Gj.) > (n — k + l) 2 /4. 
Finally, we shall deduce that in this case, G has at most as many fc-distances as 
the double broom, which will imply that e(Gk) = (n — k + l) 2 /4. Moreover, we 
shall show that in this case G is /c-isomorphic to the double broom. 

Lemma 10. For every e > there exists a constant K(e) such that for all k > 
K(e), if some geodesic in G contains ek vertices that are k- affiliated with either 
v G E(Gk) or w G E(Gk), then we can find Ik vertices that are k -unaffiliated with 
both v and w. 

Proof. Suppose that Q is a geodesic and that ek vertices of Q are fc-affiliated with 
either v or w. By the pigeonhole principle, we can assume that m > ek/2 of them 
are at distance k from w. Index them consecutively by x±, . . . , x m . For 1 < i < m, 
let Qi denote a shortest path in G from X{ to w. Let us 7-colour the vertices of Q 
so that X{ is coloured with colour j if i = j mod 7 and choose a colour (red, say) 
that belongs to at least m/7 of the Xi. For each red x%, move two steps along the 
path Qi. In this way, we obtain m' > m/7 vertices at distance k — 2 from w and 
at distance at least 7 — 2 — 2 = 3 from each other; let us call them y\, y\ 1 . . . , y^,. 
If a vertex y\ is not at distance k from v, set z\ = yj. If y\ is at distance k from 
v, take z\ to be the vertex obtained by moving from y\ one step towards v (see 
Figure [2]). Since the y\ were at pairwise distance at least 3, all of the z\ will be 
distinct vertices at distance between k — 1 and k — 3 from w and not at distance k 
from v, i.e., they are ^-unaffiliated with both v and w. 

Similarly, by 13-colouring Q, defining 'red' to be the largest colour class, and 
moving 5 steps along the paths Qi, we find m" > m/13 vertices yf, y|, . . . , y^,, 
at distance k — 5 from u> and at distance at least 13 — 5 — 5 = 3 from each 
other. This gives rise to m/13 distinct fc-unaffiliated vertices zf at distance between 
k — 4 and k — 6 from w and not at distance k from i>, i.e., the zf are disjoint from 
the previously constructed z\. 

Repeating this procedure for all Q£ + 1-colourings up to i = [_m/6j , we obtain 
a total of at least 



Lemma [TO] has the following important corollary. 

Corollary 11. Let {v,w} G Gk- Let P v and P w be geodesies as defined above. 
Then either \P V \ —ek > (2k — r) —ek vertices on P v are k -unaffiliated with v andw 
or we can find 2k vertices that are k -unaffiliated with v and w elsewhere. The same 




/c-unaffiliated vertices, which is greater than 2k for k large enough. 



□ 



is true of P x 



□ 
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Corollary [TT] and equation (12. 5p have the following immediate consequence, 
which is an approximate version of the bound in Theorem HI 



Corollary 12. For every e > there is a constant K(e) such that for allk > K(e), 
if Gk is triangle-free then 

e(G k ) < ± 1 4 } } ■ □ 

Lemma [10] and Corollary [11] also give us very useful information about the 
structure of the graph. The following lemma is the main tool in the remainder of 
the proof of Theorem [H 

Lemma 13. Let e > and let k > K(e), where K(e) is as in LemmalTU. Suppose 
that v and w are k -neighbours in a graph G that have fewer than 2 k — r — 2 
k -unaffiliated vertices. Then any geodesic in G contains fewer than 2ek vertices 
disjoint from P v , and similarly for P w . 

Proof. Suppose that Q is a geodesic in G with at least 2ek vertices disjoint from P v . 
First, if ek of these vertices are /c-unaffiliated with v and w, then, by Corollary [TT| 
v and w have at least 2k > 2k — r — 2 /^-unaffiliated vertices. Second, if not, then 
Q\P V must contain at least ek vertices that are fc-affiliated with v and w. In this 
case, by Lemma [TO] we again obtain at least 2k > 2k — r — 2 ^-unaffiliated vertices. 
In either case, we reach a contradiction. □ 



Let P denote the vw-paih and let P' denote the vw'-path, as defined in Sec- 
tion [2] Recall that the f-path P v splits into two geodesies, which we call Pi and 
P 2 , along each of which the distance from v is strictly monotone. Similarly, the 
lu-path P w splits into two geodesies, which we call P3 and P4. Note that since P v 
was defined on a tree T v that contains P, we have that P v fl P is an interval of P, 
lying entirely in Pi or in P2, and analogously for P w fl P. For the remainder of the 
proof, without loss of generality, let P v fl P C Pi and let P w fl P C P 3 . 

As was already mentioned in Section [2] by Lemma [S] we are done if any pair of 
/c-neighbours has 2k — r — 2 fc-unaffiliated vertices. So let us assume for the sake 
of contradiction that some vertices v and w at distance k have fewer than that 
many ^-unaffiliated vertices. Let w' be another fc-neighbour of v. How large can 
the distance between w and w' be? The following lemma shows that this distances 
is either close to 2k or close to 0. 

Lemma 14. Let e > and let k > K{e), where K(e) is as in LemmalTU Let v 
and w be k-neighbours with fewer than 2k — r — 2 k -unaffiliated vertices. Let w' be 
another k-neighbour of v. Then either d(w, w') = (2 — o(l))k or d(w, w') = o(k) . 

Proof. Recall that P v and P w denote the f-path and the w-path, respectively. By 
Corollary [11] P v contains at least \P V \ — ek > 2k — r — ek /^-unaffiliated vertices. 
Since, by assumption, v and w have at most 2k — r — 2 /c-unaffiliated vertices, by 
Lemma [TBI we must have 

IPJ < 2k-r + ek. 
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Figure 3: When vl G Pi, all ^-neighbours of v are close together. The dashed 
segments represent the path P\. 

Observe that G contains at most ek vertices that are /^-unaffiliated with v and w 
and are not on P v , or else we are done by Corollary [TTJ The same assertions as 
above hold for P w in place of P v . By Lemma fT3| for the vertex sets of the paths, 

|P n P v \ > (1 - 2e)k and \P n P w \ > (1 - 2e)/c. (3.1) 

Let u be the point furthest from v at which P and P„ coincide. By hypothesis, 
u G Pi. Similarly, let w' be the furthest point from v at which P' and P„ coincide. 
By Lemma [TBI d(u,w) < 2ek and d(u',w') < 2ek. It follows that 

(1 - 2e)A; < d(v, u), d(v, vl) < k. (3.2) 

Now we consider two cases: when vl G Pi and when vl G Pj. 

Suppose that n' lies on P x . Suppose first that u is closer to v than it' is (see 
Figure [3]). Then d(v,w') = d(v,w) < 2ek, thus, 

d(w, w') < d(w, u) + d(v, w') < Ask. 

If vl is closer to v than u is, then by a similar argument, d{v! ', w) = d(v', w') < 2ek, 
and so 

d(w, w') < d(w, vl) + d(u', w') < 4ek. 

If, however, v! G P2, then d(w, w') depends on the length of P w \ P. Because 
v! G P 2 , we have |P 2 | > (1 - 2e)k. Since |P X | + |P 2 | = |P„| + 1 < 2k + efc (and 
similarly for | P3 1 , |Pj| and |P«,|), we have 

(1 - 2e)k < \Pi\ < (1 + 3e)k for i = 1, 2. (3.3) 

Now we consider the geodesies P3 and P4 that comprise P w . We have assumed 
that P3 contains P^, fl P. Then, because P4 fl P = 0, we have Pi fl P4 C P x \ P, 
hence, 

I Pi n P 4 | < I Pi \ P| < (1 + 3e)k - (1 - 2e)k = 5ek. 
Thus, by Lemma [TBI 

|P 4 \ p 2 \ = |p 4 n Pi| + |P 4 \ P u | < 5ek + 2ek = 7ek. 
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w' 

Figure 4: When u' G P 2 and \P±\ is large, all fc-neighbours of v are again close 
together. The dashed segments represent the path P4. The vertices a and b denote 
the endpoints of P 2 . 

Now we shall show that if |P 4 | is at all large, then d(w, w') = o(k), while if |P 4 | 
is very small, then d(w , w') = 2k — o(k). 

Case 1: Suppose first that \P&\ > 7ek. Then, because |Pi \ P 2 | < 7ek, we have 
P2 H P4 7^ 0. Let t denote the vertex at which P4 meets P. Then by Lemma |T2l 
we have d(t, w) < 2ek. Let q be the vertex of P 2 D P4 that is closest to w (see 
Figure Sj). Then d(q,w) < |P 4 \ P 2 | + d(t,w) < 9sk. Then 

k + 9ek > d(q,v) > k - 9ek. 

Now we bound d(q, u'). The bound depends on the location of q relative to u' . 
If q is farther away from v than u' is, then by (13. 3p . 

d(q, v) < |P 2 | - d(q, u') < (1 + 3e)A; - (1 - 2e)A; = 5eA;. 

If, however, q is closer to v than w' is, then the fact that d(q, v) > k — 9ek implies 
that d(q,u') < d(q,w') < 9ek. Thus, d(q,u') < 9ek and d(q,w') < llek. We 
therefore have 

d(w, w') < d(w, q) + d(q, w') < 9ek + llek = 20ek, 

which completes the proof of Case 1. 

Case 2: Suppose instead that |P 4 | < 7ek. Then, by (13. 1 p and Lemma [T3l we 
have 

\P 2 C]P 3 \ = |P 2 | - |P 2 \PJ - |P 2 nP 4 | > (k - 2ek) - 2ek - 7ek = k - llek. (3.4) 

Since P 3 contains P w H P and P 2 is edge-disjoint from P, it follows from (13. ip 
and (pS]) that 

|P 2 nP 3 | + |PHP 3 | > 2fc- 13efc. (3.5) 

Let q' be the vertex of P 2 D P3 that is furthest away from w (see Figure [5]). Then, 
by ([22]) and (133|) . 

2k + 3ek > d{q',v) +d(v,w) > d(q',w) > |P 2 HP 3 | + |PnP 3 | > 2k - 13ek. (3.6) 
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Figure 5: When vl G P 2 and \P±\ is small, all fc-neighbours of v are at distance (2 — 
o(l)) k from one another. The dashed segments represent the portion of the path P3 
that is disjoint from P. The vertices a and b denote the endpoints of P 2 . 

Also, it follows from Q and that 

k + 3ek > \P 2 \ > d(q', v) > \P 2 n P 3 | > k - llek. 

It follows from this and ( 13. 2 p that d(q', u') < l?>ek and therefore that d(q', w') < 
15ek. We obtain from this and (I3.6P that 

d(w, w') > d(q', w) - d{q', w') > 2k - 28ek. 

This proves the lemma. □ 

We shall now show that the assumption that d(w,w') = (2 — o(l))k for some 
w' leads to a contradiction. Let 8 = 0{e) = o(l). For v G V(G), define the cluster 
of v to be the set C v of vertices at distance at most 8k from v. 

Lemma 15. Fix e > 0. For k and n large enough, let G be a graph on n vertices 
and let v and w be k-neighbours with fewer than 2k — r — 2 k -unaffiliated vertices. 
Then every k-neighbour of v is at distance 0(ek) = o(k) from w, and vice versa. 

Proof. Suppose that w' 7^ w is a ^-neighbour of v such that d(w, w') = (2 — o(l))k. 
In this case, our graph G is 'flat', i.e., it contains a geodesic P 3 of length (2 — o(l))k 
and every vertex is o(k) away from P3. The vertex v lies close to the centre of P3, 
whereas w and w' lie near the opposite ends: every fc-neighbour of v lies within o(k) 
of either w or w' . Since there are no vertices at distance (2 — o(l))k from v, 
switching v and w in the statement of Lemma yields that all fc-neighbours of w 
are close to v. Since, by assumption, v and w have fewer than 2k ^-unaffiliated 
vertices, all but at most 2k vertices are contained in one of the clusters C v , C w 
and C w i. Every vertex on P3 not lying within 28k of either v, w or w' cannot have 
a A;-neighbour in any of these clusters. Therefore any such vertex has at most 2k k- 
neighbours. So, we have c = (2 — o(l))k vertices of fc-degree at most 2k. Applying 
Turan's theorem to the remaining vertices, we obtain 

e(Gi)£ (!i_£) 2 + M< (!i4±i) 2 , 

provided that n is sufficiently large compared to k, contradicting our hypothesis 
that e(Gfc) > (n - k + l) 2 /4. □ 
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Figure 6: The vertices of Si, shown in grey, are just outside of C v , and so have no 
fc-neighbours in either C v or C w . 

Since all but at most 2k of the vertices in G are at distance k from either v 
or w, we can conclude that all but at most 2k vertices lie either in C v , that is, 
within distance o(k) of v, or in C w . This is a fairly strong structural property of G 
and from here it is a short step to completing the proof of Theorem HI 

Proof of Theorem H Observe that by the definition of 5, every vertex on the vw- 
path P that is at distance more than 5k from both v and w cannot have a k- 
neighbour in either cluster. We shall use these vertices, together with a small 
set that we shall now construct, to produce a contradiction as in the proof of 
Lemma [Jj)J 

We shall now show that IP2I must be very small and that |Pi| cannot be much 
larger than k. It will then follow that \P V \ cannot be much larger than k, either. 

If IP2I > 105k, then P2 contains 45k vertices at distance between 25k and 65k 
from v. By Lemma [131 a ^ least 45k — 2ek > 35k of these vertices are contained 
either in P 3 or in P 4 . We shall consider two cases: when at least 5k of these vertices 
are in P 3 and when at least 25k of them are in P 4 . We shall show below that either 
case produces a contradiction as in the proof of Lemma [T5J Suppose first that 5k 
of them are in P3. Let Si = {x G P3 : 25k < d(v,x) < 65k}, let y be the closest 
vertex of P3 to v and let s G P2 be such that d(v,s) = 25k (see Figure [6]). By 
Lemma [TBI and the triangle inequality, we have d(y, s) > 25k — 2ek. So, 



which means that the vertices of Si are all at distance at least k + 5k from w and 
therefore have no ^-neighbours in the clusters. 

Suppose instead that 25k of them are in P 4 . We shall show that this implies 
that |P 4 | ~ k. Let x be the closest vertex of P 4 to w and let z be the point at 
which P 4 meets Pj. Let a be the closest point of P2 to v (see Figure [7j). We may 
assume that 



d(w, s) > k — 2ek + 25k - 2ek > k + 5k, 



35k < 45k - 2ek < d(a, z) < d(v, z) < 45k. 



Then, because a, x G P, 



P 4 | > d(x, z) > d(a, x) — d(a, z) > k — 4ek — 45k. 



(3.7) 



Also, since \P W \ < (2 + e)k and |P 3 | > (1 — 2e)k, it follows that 



P 4 | < (l + 3e)k. 



(3.8) 
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Figure 7: The vertices of S2, shown in grey, have no ^-neighbours in either cluster. 



Then |P 4 | ~ k. Let S2 denote the 25k central vertices of P4. Observe that if q G S%, 
then both d(q, w) and d(q, v) are bounded away from both and k. Indeed, after 
a bit of calculation, it follows from (13. 7p and ( 13. 8 p that 

k-4ek-45k ^ . (l + 3e)k 
< d(g, to) < — — + 5k 



and that 



k - 4ek - 45k ^ ,, s ^ . ri (1 + 3e)fc 
35A; H 5fc < d(g, v) < 45k + - — + 5k. 



Together with the aforementioned vertices on P, in each case we obtain c > 
k + 5k vertices that are not in either cluster and whose /c-degree is at most 2k. By 
applying Turan's theorem to these vertices as in the proof of Lemma fTS] we obtain 
a contradiction for large enough n. Thus, |P 2 | < 105k. 

Similarly, we claim that |Pi| < k + 105k. If not, then Pi contains at least 35k 
vertices that are at distance between k + 25k and k + 55k from v, and at distance 
between 25k — 2ek and 55k + 2ek from w, and we obtain a contradiction as above. 

We therefore have 

\P V \ < k + 205k. 

Recall that by Lemma [9] we have \P V \ > 2k — r. If 2k — r > k + 205k, then 
we have a contradiction, which means that r > k — 1, that is, that G has at 
least fc — 1 interior vertices. Hence, by (12 .4p . we have e(Gk) = (n — k + l) 2 /4. Let 
/ denote the set of interior vertices of G, and recall that by definition an interior 
vertex is isolated in G^. Because e(Gk) is maximal, it follows from ( 12 .5p that for 
all {x,y} G E(Gk), the only vertices that are ^-unaffiliated with both x and y are 
those on the (unique) shortest path between x and y. Hence, all of these vertices 
must be in /. It follows that Gk \ I is a complete bipartite graph with balanced 
parts, which means that G is ^-isomorphic to the double broom, as claimed. 

Otherwise, for k and n large enough, if any geodesic outside of P has length 
more than 205k + 4ek < 215k, then it has at least 2ek vertices disjoint from P v , 
and we are done by Lemma [T3j 

Let m be the midpoint of P, or one of the midpoints if k is odd. Suppose that 
there exist /c-neighbours x and y in G such that every fc-geodesic between x and 
y misses m. Let Q be such a geodesic. Then it must miss either the path P vm 
between v and m or the path P wm between w and m. In either case, Q will contain 
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a geodesic of length at least k/4 disjoint from P, a contradiction. Hence, every 
pair of fc-neighbours is connected by a geodesic containing m. 

It follows that G is /^-isomorphic to T m , the breadth-first tree with respect 
to m. By Theorem [TJ T m has at most as many ^-distances as the double broom 
does. Thus, e(G) = (n — k + l) 2 /4 and G is fc-isomorphic to the double broom. 
The proof of Theorem H] is complete. □ 

4 Discussion 

We believe that Conjecture [3] may be susceptible to a similar approach to the one 
above. In Section [21 we obtained our first non-trivial bound, Lemma El by adapt- 
ing a proof of Mantel's theorem, which is the simplest case of Turan's theorem. 
Unfortunately we were not able find a straightforward generalisation of this ap- 
proach to i^t + i-free distance-/;; graphs when t > 3. A possible solution might be to 
adapt a proof of Turan's theorem that works for all t. However, it seems difficult 
to generalise Lemma [9] for values of t greater than 2. 

We also note that in [10], Csikvari asked a similar question about maximising 
or minimising the number of (closed) walks of length k in a connected graph G 
on n vertices and m edges. In the same paper Csikvari settled the case of closed 
walks and m — n — 1, that is, when G is a tree. The answer for general walks on 
trees was given in [8], but the general case remains open. 
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